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Abstract 

Investigations of inverse statistics (a concept borrowed from turbulence) in stock 
markets, exemplified with filtered Dow Jones Industrial Average, S&P 500, and 
NASDAQ, have uncovered a novel stylized fact that the distribution of exit time 
follows a power law p{r p ) ~ r~ a with a « 1.5 at large r p and the optimal investment 
horizon r* scales as p 7 [1,2,3]. We have performed an extensive analysis based on 
unfiltered daily indices and stock prices and high-frequency (5-min) records as well in 
the markets all over the world. Our analysis confirms that the power-law distribution 
of the exit time with an exponent of about a = 1.5 is universal for all the data sets 
analyzed. In addition, all data sets show that the power-law scaling in the optimal 
investment horizon holds, but with idiosyncratic exponent. Specifically, 7 ~ 1.5 for 
the daily data in most of the developed stock markets and the five-minute high- 
frequency data, while the 7 values of the daily indexes and stock prices in emerging 
markets are significantly less than 1.5. We show that there is of little chance that 
this discrepancy in 7 stems from the difference of record sizes in the two kinds of 
stock markets. 
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1 Introduction 



Econophysics is an interdisciplinary science which applies statistical physics 
and complex system theories to economics [4,5,6,7]. More than ten stylized 
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facts of asset returns have been discovered or re-discovered in the community 
[8], some of which are inspired originally by the analogy between finance mar- 
kets and turbulence [9,10]. In these works, the asset return, as the counterpart 
of velocity difference in turbulence, plays a central role, which is defined as the 
difference of logarithmic prices at a given time lag. Recently, a new stylized 
fact have been unveiled dealing with the inverse statistics of the exit time in 
the Dow Jones Industrial Average [1,2,3] and in the foreign exchange markets 
[11]. Interestingly, this concept of inverse statistics was also borrowed from 
turbulence [12] and applied in turbulence extensively [13,14,15,16,17,18]. 



For a given series of log prices {sj} where i corresponds to trading days, the 
exit time (or first passage time) r at time i for a given return threshold p > 
is defined as the minimal time span needed for the difference of log prices 
exceeds p for the first time. In other words, one says mathematically 

t p = inf{j - % : sj - Si > p, j > i}. (1) 

We see that r p > 1 is integer. If the stock price rises, r p = 1. It is argued that 
more small r p in a period implies a bullish market while large t p indicates a 
lasting bearish market. For fractional Brownian motion of Hurst exponent H, 
Ding and Yang [19] have found that the distribution density p(r p ) scales as 

P(r P ) ~ r p ~ a (2) 

with a = 2 — H , when t p is large. For Brownian motion, the exit time distri- 
bution has been solved analytically [20,21,22] 

p ^ = HTTT-, exp[-p 2 /4AV p ] , (3) 

where K is the generalized diffusion constant. The most probable exit time r* 
(also called the optimal investment horizon in Finance) is thus scaled as 

r p * = 2p 2 /3 . (4) 



The first passage time was studied in physics, biology and engineering [20,21,22, 
and references therein]. When applying to financial markets, the probability 
distributions of the exit time of the filtered DJIA follow a power law as ex- 
pressed by Eq. 2 with a = 1.5 [1,2,3] and the foreign exchange rates show 
a = 2.4 [11]. For the filtered DJIA, S&P 500 and NASDAQ, the optimal 
investment horizon scales against p as 

r* P - P 1 , (5) 
with 7 = 1.8, remarkably different from 2 [1]. 
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In this paper, we shall perform an extensive investigation of the inverse statis- 
tics in the stock markets. The data sets consist of daily stock indexes and 
stock prices from different markets and high-frequency (five minutes) data as 
well. We don't filter the data so that it is more rational to refer to p as the 
return threshold. 



2 Probability distribution of exit time 

The probability distribution of the DJIA has been studied and the scaling 
exponent is determined to be a = 1.5 [1,2]. To further explore the inverse 
statistics of the DJIA, we investigated the daily prices of 37 stocks that are 
or were components of DJIA since 1962. The empirical probability densities 
p(r p ) were obtained with the aim of Gaussian kernel smoothing estimation 
[23]. The empirical probability distributions of the exit time for all 37 stocks 
at three different levels p = 0.00954, 0.0596, and 0.244 are shown in Fig. 1 
with different line types. For a given p, the DJIA components have similar 
shapes in their probability distributions. This figure also shows the evolution 
of the distribution with respect to p. The power-law scaling is clearly visible 
for large r p . We find that the exponent of the power law is also a ~ 1.5, in 
agreement with that of the DJIA. 




Fig. 1. The empirical probability distributions of the exit time of daily prices of 
37 stocks that are or were components of DJIA since 1962 at three different levels 
p = 0.00954, 0.0596, and 0.244. 

We also performed analysis on the daily evolution of the S&P 500 index (from 
1940/11/29 to 2004/09/21, totally 16115 data points) and of the NASDAQ 
index (from 1971/02/05 to 2002/11/15, totally 8025 data points) in the USA 
stock markets. The power-law scaling distribution of the exit time with an 
exponent of about 1.5 is confirmed for both indexes, as reported in [2,3]. 
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A question arises naturally asking whether this new statistical property holds 
in other stock markets other than the USA. To address this question, we have 
carried out the same analysis on the following 40 indexes all over the world: 
Argentina (from 1996/10/08 to 2004/09/20), Australia (from 1984/08/03 to 
2004/09/21), Austria (from 1986/01/08 to 2002/11/15), Belgium (from 1991/01/02 
to 2002/1 1 / 18) , Brazil (from 1993 /04 /27 to 2004 /09 /20) , Canada (from 1982 /0 1/29 
to 2002/11/15), Chile (from 1997/06/09 to 2002/10/14), Czech (from 1997/07/01 
to 2004/09/20), Denmark (from 1989/12/04 to 2002/11/15), Egypt (from 
1997/07/02 to 2004/09/20), France (from 1987/07/09 to 2002/11/15), Ger- 
many (from 1959/10/01 to 2002/11/15), Hong Kong (from 1969/11/24 to 
2004/09/21), India (from 1997/07/01 to 2004/09/20), Indonesia (from 1997/07/01 
to 2004/09/20), Ireland (from 1987/03/09 to 2002/11/15), Israel (from 1992/10/08 
to 2004/09/20), Italy (from 1992/12/31 to 2002/11/15), Japan (from 1984/01/04 
to 2004/09/17), Korea (from 1997/07/01 to 2004/09/20), Malaysia (from 
1977/01/03 to 2002/11/15), Mexico (from 1992/01/02 to 2002/11/15), The 
Netherlands (from 1983/01/03 to 2002/11/15), New Zealand (from 1988/08/31 
to 2002/11/15), Norway (from 1987/01/02 to 2002/11/15), Pakistan (from 
1997/07/02 to 2004/09/20), Philippines (from 1997/07/02 to 2004/09/20), 
Russia (from 1997/07/01 to 2004/09/20), SP500 (from 1940/11/29 to 2004/09/21), 
Singapore (from 1985/01/04 to 2002/11/15), Slovakia (from 1997/07/01 to 
2002/10/14), South Africa (from 1995/06/30 to 2002/11/15), Spain (from 
1987/01/05 to 2002/11/15), Sri Lanka (from 1997/07/01 to 2004/09/20), 
Sweden (from 1986/12/18 to 2002/11/15), Switzerland (from 1989/07/03 to 
2002/11/15), Taiwan (from 1997/07/02 to 2004/09/20), Thailand (from 1997/07/02 
to 2004/09/20), Turkey (from 1997/07/01 to 2004/09/20), UK (from 1984/04/02 
to 2004/09/21), Venezuela (from 2000/07/04 to 2002/09/05). We also ob- 
served a power-law distribution of the exit time with an exponent close to 
1.5. 

As a typical example of emerging markets, we present in Fig. 2 the empirical 
probability distributions of the exit time of daily prices of the Shanghai Stock 
Exchange Composite, A Share Index, B Share Index, and 55 stocks listed in 
Shanghai Stock Exchange and Shenzhen Stock Exchange before 1992/12/01 
at three different levels p = 0.00954, 0.0596, and 0.244. One finds again that 
Eq. (4) holds and a ~ 1.5. 

So far, we have checked the validity of the new stylized fact in the daily 
evolutions of indexes and stock prices in both developed markets and emerging 
markets. To further clarify the situation, explorations were carried out on 10 
high-frequency (5 minutes) data sets, including CAC 40 of France (covering 
July, August, September, and October of 2001, totally 9332 data points), S&P 
500 of USA (covering 1995, 1996, and 1997, totally 59885 data points), SSEC of 
China (from 2002/07/01 to 2004/09/16, totally 24955 data points), and seven 
stocks traded in the Shanghai Stock Exchange (six of them from June 2002, 
one from 2002/10/09, and one from 2003/07/25, all up to 2004/09/16). Again, 
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Fig. 2. The empirical probability distributions of the exit time of daily prices of the 
Shanghai Stock Exchange Composite, A Share Index, B Share Index, and 55 stocks 
listed in Shanghai Stock Exchange and Shenzhen Stock Exchange before 1992/12/01 
at three different levels p = 0.00954, 0.0596, and 0.244. 

the scaling law (4) holds and a ~ 1.5. Figure 3 illustrates the distributions of 
the data sets in the Chinese stock market. 




io° io' io 2 io 3 

X 

p 



Fig. 3. The empirical probability distributions of the exit time of high-frequency 
(5 minutes) prices of the Shanghai Stock Exchange Composite and 7 stocks from 
2002/07/01 (or 2002/10/09, or 2003/07/25) to 2004/09/16 in China at three differ- 
ent levels p = 0.000954, 0.00596, and 0.0244. 

We also analyzed the 1-minute-by-l-minute intraday NASDAQ data over 50 
open days in the past from Monday Oct. 30, 2000. We observed a clear 
crossover from one power-law scaling to another separated with a kink around 
r p ps 80. The scaling exponent in the region r* < r p < 80 is ati = 1.1. The 
scaling exponent in the region r p > 80 is hard to estimate due to statistics. 
However, we can see that a 2 of the second region is comparable to 1.5 or ever 
larger. This may signal a possible discrepancy between data sets of sampling 
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frequency higher or lower than 1/min. We shall come back to this in the future 
when longer data sets are available. 



3 Optimal investment horizon 

In the previous section, we verify the finding of a power-law scaling with 
an exponent of about 1.5 [1,2,3] using numerous stock market indexes and 
individual stocks whose recording frequency is daily or 5 min all over the 
world. In this section, we focus on the power law relation (5) between the 
optimal investment horizon r* and the threshold p. We find that this power 
law (5) holds for all the data sets we studied in Section 2 but with quite 
different exponents. 

For daily DJIA (1896-2004), we find that 7 « 1.55, while S&P 500 (1940-2004) 
and NASDAQ (1971-2002) give 7 « 1.5. These values are significantly lower 
than those reported in [1,2,3]. This discrepancy may be due to the fact that 
we don't preprocess the data with a high-pass filter. Calculations with the 37 
stocks in DJIA give similar results, as illustrated in Fig. 4. 
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Fig. 4. The dependence of optimal investment horizon r* with respect to p of daily 
prices of 37 stocks that are or were components of DJIA since 1962. The three lines 
with clear cutoffs at large p correspond to stock prices with relatively short sizes. 

For the 40 indexes in the stock markets throughout the world that we studied 
in the previous section, 22 stock markets have an exponent 7 ~ 1.50 — 1.55. 
The rest 18 indexes have different 7 values. The Taiwan stock market index 
gives a higher scaling exponent 7 = 1.64, while the others have smaller 7 
values (Belgium 1.40, Brazil 1.42, Chile 1.20, Czech 1.42, Egypt 1.20, Indone- 
sia 1.35, Israel 1.40, Korea 1.39, Malaysia 1.36, New Zealand 1.40, Pakistan 
1.37, Philippines 1.29, Russia 1.37, Singapore 1.38, Sri Lanka 1.39, Turkey 
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1.33, Venezuela 1.0). With only a few exceptions, we see that most of the 22 
stock markets having 7 w 1.50 — 1.55 are developed markets, while most of 18 
remaining indexes are emergent markets. In addition, Fig. 5 shows the depen- 
dence on p of r* of 58 time series in the Chinese stock market. The value of 7 
is around 1.25. Therefore, the exponent 7 is a kind of measure of the maturity 
of stock market in some sense. 




Fig. 5. The dependence of optimal investment horizon r* with respect to p of daily 
prices of the Shanghai Stock Exchange Composite, A Share Index, B Share Index, 
and 55 stocks listed in Shanghai Stock Exchange and Shenzhen Stock Exchange 
before 1992/12/01. 
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Fig. 6. The dependence of optimal investment horizon r* with respect to p of 
high-frequency (5 minutes) prices of the Shanghai Stock Exchange Composite and 
7 stocks from 2002/07/01 (or 2002/10/09, or 2003/07/25) in China. 

In Fig. 6 shows the dependence of optimal investment horizon r* with respect 
to p of high-frequency (5 minutes) prices of the Shanghai Stock Exchange 
Composite and 7 stocks in China. It is interesting to notice that the exponents 
of the six high-frequency time series in the Chinese stock markets are close 
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to 1.50. The CAC 40 and S&P 500 high-frequency data have 7 « 1.50 as 
expected. 



4 Does the size matter? 

In Section 3, we have seen that the 7 values of most of the emerging markets 
are less than 1.4 and most of indexes or stock prices with 7 < 1.4 are from 
emerging markets. For an emerging market, the sizes of its recorded index 
and stock prices are usually short. In China, for instance, the first market 
for government-approved securities was founded in Shanghai on November 
26, 1990 and started operating on December 19 of the same year under the 
name of the Shanghai Stock Exchange, and shortly after, the Shenzhen Stock 
Exchange was established on December 1, 1990 and started its operations on 
July 3, 1991 [24]. On the contrary, most of the major western stock markets 
have much longer histories. It is thus of great importance to check whether 
such a significant difference in scaling behaviors of the optimal investment 
horizon is due to the short size of the recorded time series in the emerging 
markets. 

To understand this idiosyncratic scaling behavior in the emerging stock mar- 
kets, we take the daily DJIA index from 1896 to 2004 as a proxy. This time 
series has 29493 data points. We run a moving window of size 3500 (about 14 
years) along the evolution of DJIA with a step of 20 trading days. This gives 
1300 moving windows. For each window, we determine its exit times of all 
points separately. In other words, each running window is treated as an inde- 
pendent time series. Then the optimal in vestment horizon r* is determined for 
each window at 21 threshold levels logarithmically spaced in [0.0194,0.3237]. 
To estimate the value of 7 for each window, we require a linear regression co- 
efficient larger than 0.995. If this constraint is not fulfilled, we remove the last 
point and regress the remaining data again. This iterative procedure repeats 
again and again until the linear regression coefficient is larger than 0.995. For 
most of the data sets, this constraint satisfies with all the 21 points. 

The probability distribution 22(7) of the estimated 7's is plotted in Fig. 7 with 
solid line. The maximum locates at 7 max = 1.51. The percentage of the win- 
dows with 7 < 1.4 is 13.85%. When the Chinese stock markets are concerned, 
only 22 windows have 7 < 1.3, implying a percentage of 1.69%. This result 
already supports the point that there is little chance that the idiosyncratic 
scaling behavior in the Chinese stock markets could not be attributed to its 
short history and this property is thus of little probability to be artificial. This 
conclusion is still significant although less strong in other emergent markets. 

To further enhance our point, we have taken a closer look at the 180 regressions 
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Fig. 7. Probability distribution of the estimated scaling exponent 7. The probability 
distributions are estimated with Gaussian kernel smoothing approach. 

where 7 < 1.4. Figure 8 shows the dependence of r* against p for all the 180 
windows. There are three clusters in Fig. 8. The lines in the bottom cluster 
exhibit a sudden jump around p = 0.08. The left parts with relatively small 
p of all lines are linear with a slope of 1.42. The right parts show slightly 
downward bending. If we fit the points in the left part for each line, we find 
that the percentage of running windows with 7 < 1.4 reduces dramatically to 
2.15%, while the number of windows with 7 < 1.3 is null. 




Fig. 8. Dependence of T* p against p for the 180 windows where 7 < 1.4. 

In Fig. 7, one observes that there are also quite a few windows whose 7 > 1.6. 
The number of these windows is 214, a percentage of 16.46%. Figure 9 shows 
the dependence of r* against p for all the 214 windows. For large p, the linearity 
deteriorates remarkably. The slopes of these lines are comparable to 1.65. If 
we redo the regressions after excluding the points at large p, the percentage of 



9 



windows with 7 > 1.6 reduces to 12.31%. The probability distribution ^(7) of 
the modified 7's is plotted in Fig. 7 with dashed line. The maximum locates 
at 7 m ax = 1-52. The kurtosis excess of the distribution in dashed line is 4.56, 
higher than that in solid line with a value of 3.50. 
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Fig. 9. Dependence of r* against p for the 180 windows where 7 > 1.6. 

At last, let us average r* over the 1300 running windows for each threshold p. 
A perfect power law is observed as follows 

ln(7-;) = 1.56 lnp + 7.86 . (6) 

The linear regression coefficient is 0.9997. The power-law exponent 1.56 is 
slightly larger than the 7 max due to the fact that the skewness of the two 
distributions shown in Fig. 7 are 0.46 and 0.96, respectively. 



5 Concluding remarks 

There are universal and idiosyncratic behaviors in the developed and emerging 
stock markets, such as the log-periodic power-law antibubble pattern [25] or 
return anomalies [26] in China's stock markets. In this paper, we have verified 
that the probability density of exit time follows a universal power law with 
exponent a = 1.5 for large r p for daily indexes and stock prices and 5-min 
high-frequency data in both the developed and emerging markets. However, 
this statistical property can not be extrapolated to other financial markets, 
at least not to the tick-by-tick data in the foreign exchange markets where 
the power law exponent a ~ 2.4 is remarkably greater than the one in stock 
markets [11]. 

On the other hand, although the optimal investment horizon r* scales as 
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p 7 for all stock markets investigated, they have quite different values of 7. 
Roughly speaking, the 7 values of the daily data in emerging stock markets 
are significantly less than those in major western markets (7 ~ 1.5). We have 
showed that this discrepancy in 7 couldn't stems from the difference of record 
sizes in the two kinds of stock markets. In words, for a given gain threshold, 
the optimal investment horizon in an emerging market is shorter than that in 
a developed market. This is consistent with the known result that the optimal 
trading time lag is about 30 days in China's stock markets, much shorter than 
that in the USA markets, due to the speculative nature of emerging markets 
[25]. 

For 5-min high-frequency data (CAC 40 index, S&P 500 index, and SSEC and 
stock prices in China) we find that 7 ~ 1.5. This intriguing feature can be 
interpreted by the t+1 trading rule in the Chinese stock markets which reduces 
the speculation at high-frequency levels. As a cautionary note, we notice that 
the 1-min high-frequency data of NASDAQ do not follow the simple power-law 
relation in the exit time distribution, which calls for a further investigation. 
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